Protein Structure Similarity Measurement by Language Modeling Techniques

نویسندگان

  • Jafar Razmara
  • Safaai B. Deris
چکیده

In the era of structural biology, it is necessary to apply efficient and effective tools in order to measure structural similarity between proteins. Although a great number of structural comparison methods have been developed, none of them gives an exact solution to the problem. In this paper, we introduce a novel method for structural similarity measurement of proteins based on language modeling techniques. The roots of the method are inspired from computational linguistics and the related techniques for quantifying and comparing strings of characters. In this way, the protein structure is represented in sequences of characters and then n-gram based modeling techniques are applied to capture the content regularities. In the sequel, these regularities are contrasted by cross-entropy concept and the similarity between two protein structures is measured. To find an overlap between two protein structures in 3D-space, a superposition task is also applied. In this very first attempt, the experimental results represent the usefulness of the new approach and motivate further studies on development of tools based on computational linguistics methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computer Aided Molecular Modeling Of Membrane Metalloprotease

Molecular modeling is a set of computational techniques for construction of 3D structure of a protein especially membrane bound proteins whose structures can not be elucidated using experimental techniques. These techniques has been applied in the study of membrane metalloproteases for comparing wild and mutated enzymes, docking inhibitors in the catalytic site and examination of binding pocket...

متن کامل

In Silico Analysis of Primary Sequence and Tertiary Structure of Lepidium Draba Peroxidase

Peroxidase enzymes are vastly applicable in industry and diagnosiss. Recently, we introduced a new kind of peroxidase gene from Lepidium draba (LDP). According to protein multiple sequence alignment results, LDP had 93% similarity and 88.96% identity with horseradish peroxidase C1A (HRP C1A). In the current study we employed in silico tools to determine, to which group of peroxidase enzymes LDP...

متن کامل

A novel method for detecting structural damage based on data-driven and similarity-based techniques under environmental and operational changes

The applications of time series modeling and statistical similarity methods to structural health monitoring (SHM) provide promising and capable approaches to structural damage detection. The main aim of this article is to propose an efficient univariate similarity method named as Kullback similarity (KS) for identifying the location of damage and estimating the level of damage severity. An impr...

متن کامل

Isoelectric Focusing and PCR-RFLP Joined Techniques for Alpha1-antitrypsin Deficiency Detection

53 persons suspected to alpha1-antitrypsin deficiency detection (AATD) were investigated for ZZ, MZ, ZS, SS, and MS alleles analysis by serum protein electrophoresis (SPE), measurement of trypsin inhibiting capacity (TIC), isoelectric focusing (IEF), polymerase chain reaction (PCR), and IEF/PCR-RFLP techniques. The result clearly shows by using SPE and TIC techniques only 35.85 % and 50.08% of ...

متن کامل

Effects of T208E activating mutation on MARK2 protein structure and dynamics: Modeling and simulation

Microtubule Affinity-Regulating Kinase 2 (MARK2) protein has a substantial role in regulation of vital cellular processes like induction of polarity, regulation of cell junctions, cytoskeleton structure and cell differentiation. The abnormal function of this protein has been associated with a number of pathological conditions like Alzheimer disease, autism, several carcinomas and development of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009